NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Stealthy Backdoor Attack in Federated Learning via Adaptive Layer-wise Gradient Alignment

Yang, Qingqian; Yan, Peishen; Wu, Xiaoyu; Zhang, Jiaru; Song, Tao; Hua, Yang; Wang, Hao; Wang, Liangliang; Guan, Haibing (October 2025, International Conference on Computer Vision (ICCV) 2025)

Free, publicly-accessible full text available October 22, 2026
AMPERE: A Generic Energy Estimation Approach for On-Device Training

https://doi.org/10.1145/3764944.3764951

Zhang, Jiaru; Wang, Zesong; Wang, Hao; Song, Tao; Su, Huai-an; Chen, Rui; Hua, Yang; Zhou, Xiangwei; Ma, Ruhui; Pan, Miao; et al (August 2025, ACM SIGMETRICS Performance Evaluation Review)

Battery-powered mobile devices (e.g., smartphones, AR/VR glasses, and various IoT devices) are increasingly being used for AI training due to their growing computational power and easy access to valuable, diverse, and real-time data. On-device training is highly energy-intensive, making accurate energy consumption estimation crucial for effective job scheduling and sustainable AI. However, the heterogeneity of devices and the complexity of models challenge the accuracy and generalizability of existing methods. This paper proposes AMPERE, a generic approach for energy consumption estimation in deep neural network (DNN) training. First, we examine the layer-wise energy additivity property of DNNs and strategically partition the entire model into layers for fine-grained energy consumption profiling. Then, we fit Gaussian Process (GP) models to learn from layer-wise energy consumption measurements and estimate a DNN's overall energy consumption based on its layer-wise energy additivity property. We conduct extensive experiments with various types of models across different real-world platforms. The results demonstrate that AMPERE has effectively reduced the Mean Absolute Percentage Error (MAPE) by up to 30%. Moreover, AMPERE is applied in guiding energy-aware pruning, successfully reducing energy consumption by 50%, thereby further demonstrating its generality and potential.
more » « less
Free, publicly-accessible full text available August 26, 2026
SKYMASK: Attack-Agnostic Robust Federated Learning with Fine-Grained Learnable Masks

Yan, Peishen; Wang, Hao; Song, Tao; Hua, Yang; Ma, Ruhui; Hu, Ningxin; Haghighat, Mohammad Reza; Guan, Haibing (September 2024, Springer Cham)

Full Text Available
Backdoor Federated Learning by Poisoning Backdoor-Critical Layers

Zhuang, Haomin; Yu, Mingxian; Wang, Hao; Hua, Yang; Li, Jian; Yuan, Xu (May 2024, The 12th International Conference on Learning Representations (ICLR))

Federated learning (FL) has been widely deployed to enable machine learning training on sensitive data across distributed devices. However, the decentralized learning paradigm and heterogeneity of FL further extend the attack surface for backdoor attacks. Existing FL attack and defense methodologies typically focus on the whole model. None of them recognizes the existence of backdoor-critical (BC) layers-a small subset of layers that dominate the model vulnerabilities. Attacking the BC layers achieves equivalent effects as attacking the whole model but at a far smaller chance of being detected by state-of-the-art (SOTA) defenses. This paper proposes a general in-situ approach that identifies and verifies BC layers from the perspective of attackers. Based on the identified BC layers, we carefully craft a new backdoor attack methodology that adaptively seeks a fundamental balance between attacking effects and stealthiness under various defense strategies. Extensive experiments show that our BC layer-aware backdoor attacks can successfully backdoor FL under seven SOTA defenses with only 10% malicious clients and outperform the latest backdoor attack methods.
more » « less
Full Text Available
Backdoor Federated Learning by Poisoning Backdoor-Critical Layers

Zhuang, Haomin; Yu, Mingxian; Wang, Hao; Hua, Yang; Li, Jian; Yuan, Xu (May 2024, The Twelfth International Conference on Learning Representations (ICLR 2024))

Federated learning (FL) has been widely deployed to enable machine learning training on sensitive data across distributed devices. However, the decentralized learning paradigm and heterogeneity of FL further extend the attack surface for backdoor attacks. Existing FL attack and defense methodologies typically focus on the whole model. None of them recognizes the existence of backdoor-critical (BC) layers-a small subset of layers that dominate the model vulnerabilities. Attacking the BC layers achieves equivalent effects as attacking the whole model but at a far smaller chance of being detected by state-of-the-art (SOTA) defenses. This paper proposes a general in-situ approach that identifies and verifies BC layers from the perspective of attackers. Based on the identified BC layers, we carefully craft a new backdoor attack methodology that adaptively seeks a fundamental balance between attacking effects and stealthiness under various defense strategies. Extensive experiments show that our BC layer-aware backdoor attacks can successfully backdoor FL under seven SOTA defenses with only 10% malicious clients and outperform the latest backdoor attack methods.
more » « less
Full Text Available
Cheaper and Faster: Distributed Deep Reinforcement Learning with Serverless Computing

https://doi.org/10.1609/aaai.v38i15.29592

Yu, Hanfei; Li, Jian; Hua, Yang; Yuan, Xu; Wang, Hao (March 2024, Proceedings of the AAAI Conference on Artificial Intelligence)

Deep reinforcement learning (DRL) has gained immense success in many applications, including gaming AI, robotics, and system scheduling. Distributed algorithms and architectures have been vastly proposed (e.g., actor-learner architecture) to accelerate DRL training with large-scale server-based clusters. However, training on-policy algorithms with the actor-learner architecture unavoidably induces resource wasting due to synchronization between learners and actors, thus resulting in significantly extra billing. As a promising alternative, serverless computing naturally fits on-policy synchronization and alleviates resource wasting in distributed DRL training with pay-as-you-go pricing. Yet, none has leveraged serverless computing to facilitate DRL training. This paper proposes MinionsRL, the first serverless distributed DRL training framework that aims to accelerate DRL training- and cost-efficiency with dynamic actor scaling. We prototype MinionsRL on top of Microsoft Azure Container Instances and evaluate it with popular DRL tasks from OpenAI Gym. Extensive experiments show that MinionsRL reduces total training time by up to 52% and training cost by 86% compared to latest solutions.
more » « less
Full Text Available

Search for: All records